Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages

نویسندگان

  • Nikesh Garera
  • David Yarowsky
چکیده

This paper presents an approach to the translation of compound words without the need for bilingual training text, by modeling the mapping of literal component word glosses (e.g. “iron-path”) into fluent English (e.g. “railway”) across multiple languages. Performance is improved by adding component-sequence and learnedmorphology models along with context similarity from monolingual text and optional combination with traditional bilingual-textbased translation discovery.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Translation for Languages Lacking Bitext via Multilingual Gloss Transduction

We propose and evaluate a new paradigm for machine translation of low resource languages via the learned surface transduction and paraphrase of multilingual glosses.

متن کامل

Polyglot Semantic Parsing in APIs

Traditional approaches to semantic parsing (SP) work by training individual models for each available parallel dataset of text-meaning pairs. In this paper, we explore the idea of polyglot semantic translation, or learning semantic parsing models that are trained on multiple datasets and natural languages. In particular, we focus on translating text to code signature representations using the s...

متن کامل

Multilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses

This paper presents a method to create WordNet-like lexical resources for different languages. Instead of directly translating glosses from one language to another, we perform first semantic parsing of WordNet glosses and then translate the resulting semantic representation. The proposed approach simplifies the machine translation of the glosses. The approach provides ready to use semantic repr...

متن کامل

On The Effects of Literal Translation, L1 Glosses and Context, Applied in Reading Comprehension, on Iranian EFL Learners Vocabulary Learning: The Case of Different Proficiencies

This paper aims at discovering and investigating the effect of l1 translation, l1 gloss andcontext on vocabulary learning on EFL Iranian learners. A total number of 120 EFL students inprivate English institutes in Sari participated in the present study. They were divided into twoproficiency groups and three learning conditions. In order to make a list of words unknown to thelearners the partici...

متن کامل

Word Representation Models for Morphologically Rich Languages in Neural Machine Translation

Dealing with the co mplex word forms in morphologically rich languages is an open problem in language processing, and is particularly important in translation. In contrast to most modern neural systems of translation, which discard the identity for rare words, in this paper we propose several architectures for learning word representations from character and morpheme level word decompositions. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008